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GENES ASSOCIATED WITH DISEASES OF THE KIDNEY 

TECHNICAL FIELD 

The invention relates to nine genes associated with diseases of the kidney, as identified by 
5 their coexpression with genes known to be associated with renal disorders. The invention also 
relates to polypeptides encoded by these genes and to the use of these amino acid and nucleic acid 
sequences in the diagnosis, prognosis, prevention, treatment, and evaluation of therapies for diseases 
of the kidney. 

BACKGROUND ART 

10 The kidneys excrete waste and contribute to maintenance of homeostasis by regulating 

blood pH electrolyte levels and blood pressure. Renal failure or dysfunction is the cause of several 
diseases whose clinical manifestations are typically hyper- or hypo-tension. These diseases include 
Banter's syndrome, Gitelman syndrome, autosomal dominant polycystic kidney disease, and 
nephrolithiasis. 

15 Treatment with antibiotics such as gentamycin, tobramycin, or amikacin can lead to tubular 

necrosis with consequent risk of acute renal failure in as much as 25% of patients. A cumulative 
dose of 2 to 3 grams amphotericin B is likewise nephrotoxic. Cyclosporine, an immunosuppressant 
given to organ transplant patients, is also nephrotoxic. The molecular basis for the nephrotoxicity 
of these and other compounds is not understood, although calbindin D levels appear to be a useful 

20 diagnostic marker signaling cyclosporine-induced nephrotoxicity (Aicher et al.( 1 998) 

Electrophoresis 19:1 998-2003). Similarly, reductions in urinary excretion of proteins such as 
Tamm-Horsfall protein, beta 2 microglobin, and urinary retinol binding protein, appear to be 
diagnostic for nephrotoxicity when induced by certain chemotherapeutic agents including ifosfamide 
and cisplatinum (MacLean et al. (1998) Cancer Chemother. Pharmacol. 41:413-416; Tokuc a al. 

25 (1997) J. Exp. Clin. Cancer Res. 16:227-230). 

We have identified nine novel kidney disease-associated genes through their co-expression 
with genes known to affect kidney function in normal and diseased states. The present invention 
satisfies a need in the art by providing new amino acid and nucleic acid compositions that are useful 
for diagnosis, prognosis, treatment, prevention, and evaluation of therapies for renal disorders. 

30 SUMMARY OF THE INVENTION 

In one aspect, the invention provides for a substantially purified polynucleotide comprising 
a gene, that is coexpressed with one or more known kidney disease-associated genes in a plurality of 
biological samples. Preferably* known kidney disease-associated genes are selected from the group 
consisting of uromodulin, NKCC2 (bumetanide-sensitiveNa-K-Cl cotransporter 2), NCCT 
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(thiazine-sensitive Na-Ci cotransporter), aldolase B, ROMK1 (inwardly-rectifying voltage-gated K 
channel). ATP1G1 (Na-K ATPase gamma subunit), PDZK 1 (PDZ domain-containing protein), 
NPT-1 (Na-dependent phosphate cotransporter), calbindin, kininogen, and CIC-Kb (chloride 
channel). Preferred embodiments include (a) a polynucleotide sequence selected from the group 
5 consisting of SEQ ID NO:l , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4. SEQ ID NO:5, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9 (SEQ ID NOs:l-9); (b) a polynucleotide 
sequence which encodes a polypeptide sequence selected from the group consisting of SEQ ID 
NO:10, SEQ ID NO:l 1. and SEQ ID NO:12 (SEQ ID NOs:10-12); (c) a polynucleotide sequence 
having at least 75% identity to the polynucleotide sequence of (a) or (b); (d) a polynucleotide 
10 sequence which is complementary to the polynucleotide sequence of (a), (b), or (c); (e) a 

polynucleotide sequence comprising at least 10, preferably at least 18, sequential nucleotides of the 
polynucleotide sequence of (a), (b), (c), or (d); or (f) a polynucleotide which hybridizes under 
stringent conditions to the polynucleotide of (a), (b), (c). (d), or (e). Furthermore, the invention 
provides an expression vector comprising any of the above described polynucleotides, and a host cell 
15 containing the expression vector. Still further, the invention provides a method for treating or 

preventing a disease or condition associated with the altered expression of a gene that is coexpressed 
one or more known kidney disease-associated genes comprising administering to a subject in need a 
polynucleotide described above in an amount effective for treating or preventing the disease. 

In a second aspect, the invention provides a substantially purified polypeptide comprising 
20 the gene product of a gene that is coexpressed with one or more known kidney disease-associated 
genes in a plurality of biological samples. The known kidney disease-associated gene may be 
selected from the group consisting of uromodulin, NKCC2, NCCT, aldolase B, ROMK1, ATP1G1, 
PDZK1, NPT-1 , calbindin, kininogen, and CIC-Kb. Preferred embodiments are (a) the polypeptide 
sequence selected from the group consisting of SEQ ID NOs:10-12; (b) a polypeptide sequence 
25 having at least 85% identity to the polypeptide sequence of (a); and (c) a polypeptide sequence 

comprising at least 6 sequential amino acids of the polypeptide sequence of (a) or (b). Additionally, 
the invention provides antibodies that bind specifically to any of the above described polypeptides 
and a method for treating or preventing a disease or condition associated with the altered expression 
of a gene that is coexpressed with one or more known kidney disease-associated genes comprising 
30 administering to a subject in need such an antibody in an amount effective for treating or preventing 
the disease. 

In another aspect, the invention provides a pharmaceutical composition comprising the 
polynucleotide of claim 2 or the polypeptide of claim 3 in conjunction with a suitable 
pharmaceutical carrier and a method for treating or preventing a disease or condition associated 
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with the altered expression of a gene that is coexpressed with one or more known kidney 
disease-associated genes comprising administering to a subject in need such a composition in an 
amount effective for treating or preventing the disease. 

In a further aspect, the invention provides a method for diagnosing a disease or condition 

5 associated with the altered expression of a gene that is coexpressed with one or more known kidney 
disease-associated genes, wherein each known kidney disease-associated gene is selected from the 
group consisting of uromodulin, NKCC2, NCCT, aldolase 13, ROMK1, ATP1G1, PDZK1, 
NPT-1, calbindin, kininogen, and CIC-Kb. The method comprises the steps of (a) providing a 
sample comprising one or more of the coexpressed genes; (b) hybridizing the polynucleotide of 

10 claim 2 to the coexpressed genes unto conditions effective to form one or more hybridization 
complexes; (c) detecting the hybridization complexes; and (d) comparing the levels of the 
hybridization complexes with the level of hybridization complexes in a nondiseased sample, wherein 
altered levels of one or more of the hybridization complexes in a diseased sample compared with the 
level of hybridization complexes in a non-diseased sample correlates with the presence of the disease 

15 or condition. 

Additionally, the invention provides antibodies, antibody fragments, and immunoconjugates 
that exhibit specificity to any of the above described polypeptides and methods for treating or 
preventing diseases or conditions of the kidney. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTING 
20 The Sequence Listing provides exemplary kidney disease-associated gene sequences 

including polynucleotide sequences, SEQ ID NOs:l-9, and the polypeptide sequences, SEQ ID 
NOs:10-12. Each sequence is identified by a sequence identification number (SEQ ID NO) and by 
the Incyte clone number with which the sequence was first identified. 

DESCRIPTION OF THE INVENTION 
25 It must be noted that as used herein and in the appended claims, the singular forms "a", 

"an", and "the" include the plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled in 
the art, and so forth. 
30 DEFINITIONS 

"NSEQ" refers generally to a polynucleotide sequence of the present invention, including 
SEQ ID NOs:l-9. "PSEQ" refers generally to a polypeptide sequence of the present invention, 
SEQ ID NOs:10-12. 

A "fragment" refers to a nucleic acid sequence that is preferably at least 20 nucleic acids in 

3 
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length, more preferably 40 nucleic acids, and most preferably 60 nucleic acids in length, and 
encompasses, for example, fragments consisting of nucleic acids 1-50, 51-400, 401-4000, 
4001-12,000, and the like, of SEQ ID NOs:l-9. 

"Gene" refers to the partial or complete coding sequence of a gene and to its 5' or V 
5 untranslated regions. The gene may be in a sense or antisense (complementary) orientation. 

"Kidney disease-associated gene" refers to a gene whose expression pattern is similar to that 
of known kidney disease-associated genes which are useful in the diagnosis, treatment, prognosis, or 
prevention of diseases of the kidney. "Known kidney disease-associated gene" refers to a sequence 
which has been previously identified as useful in the diagnosis, treatment, prognosis, or prevention 
10 of diseases of the kidney. Typically, this means that the known gene is expressed at higher levels 
(i.e., has more abundant transcripts) in diseased or cancerous kidney tissue than in normal or 
non-diseased kidney or in any other tissue. 

"Polynucleotide" refers to a nucleic acid, nucleic acid sequence, oligonucleotide, nucleotide, 
or any fragment thereof. It may be DNA or RNA of genomic or synthetic origin, double-stranded or 
15 single-stranded, and combined with carbohydrate, lipids, protein or other materials to perform a 
particular activity or form a useful compositioa "Oligonucleotide" is substantially equivalent to the 
terms amplimer, primer, oligomer, element, and probe. 

"Polypeptide" refers to an amino acid, amino acid sequence, oligopeptide, peptide, or 
protein or portions thereof whether naturally occurring or synthetic. 
20 A "portion" refers to peptide sequence which is preferably at least 5 to about 15 amino 

acids in length, most preferably at least 10 amino acids long, and which retains some biological or 
immunological activity of, for example, a portion of SEQ ID NOs:10-12. 

"Sample" is used in its broadest sense. A sample containing nucleic acids may comprise a 
bodily fluid; an extract from a cell, chromosome, organelle, or membrane isolated from a cell; 
25 genomic DNA. RNA, or cDNA in solution or bound to a substrate; a cell; a tissue; a tissue print; 
and the like. 

"Substantially purified" refers to a nucleic acid or an amino acid sequence that is removed 
from its natural environment and that is isolated or separated, and is at least about 60% free, 
preferably about 75% free, and most preferably about 90% free, from other components with which 

30 it is naturally present. 

"Substrate" refers to any suitable rigid or semi-rigid support to which polynucleotides or 
polypeptides are bound and includes membranes, filters, chips, slides, wafers, fibers, magnetic or 
nonmagnetic beads, gels, capillaries or other tubing, plates, polymers, and microparticles with a 
variety of surface forms including wells, trenches, pins, channels and pores. 



WO 00/61622 



PCT/US00/08260 



A " variant" refers to a polynucleotide or polypeptide whose sequence diverges from SEQ 
ID NOs: 1-9. Polynucleotide sequence divergence may result from mutational changes such as 
deletions, additions, and substitutions of one or more nucleotides; it may also be introduced to 
accommodate differences in codon usage. Each of these types of changes may occur alone, or in 
5 combination, one or more times in a given sequence. 
THE INVENTION 

The present invention encompasses a method for identifying biomolecules that are 
associated with a specific disease, regulatory pathway, subcellular compartment, cell type, tissue 
type, or species. In particular, the method identifies genes useful in diagnosis, prognosis, treatment, 
10 prevention, and evaluation of therapies for diseases of the kidney including, but not limited, Banter's 
syndrome, Gitelman syndrome, nephrolithiasis, renal amyloidosis, hypertension; primary 
aldosteronism; Addison's disease; renal failure; glomerulonephritis; chronic glomerulonephritis; 
tubulointerstitial nephritis; cystic disorders of the kidney and dysplastic malformations such as 
polycystic disease, renal dysplasias, and cortical or medullary cysts; inherited polycystic renal 
15 diseases (PRD), such as recessive and autosomal dominant PRD; medullary cystic disease; 

medullary sponge kidney and tubular dysplasia; Alport's syndrome; non-renal cancers which affect 
renal physiology, such as bronchogenic tumors of the lungs or tumors of the basal region of the 
brain; multiple myeloma; adenocarcinomas of the kidney; metastatic renal carcinoma; in addition, 
nephrotoxic disorders include any functional or morphologic change in the kidney produced by any 
20 pharmaceutical, chemical, or biological agent that is ingested, injected, inhaled, or absorbed. Some 
broad categories of common nephrotoxic agents are heavy metals, all classes of antibiotics, 
analgesics, solvents, oxalosis-inducing agents, anticancer drugs, herbicides and pesticides, 
botanicals and biologicals, and antiepileptics. 

The method entails first identifying polynucleotides that are expressed in a plurality of 
25 cDNA libraries. The identified polynucleotides include genes of known or unknown function which 
are expressed in a specific disease process, subcellular compartment, cell type, tissue type, or 
species. The expression patterns of the genes with known function are compared with those of 
genes with unknown function to determine whether a specified coexpression probability threshold is 
met. Through this comparison, a subset of the polynucleotides having a high coexpression 
30 probability with the known genes can be identified. The high coexpfessioh probability correlates 
with a particular coexpression probability threshold which is preferably less than 0.001 and more 
preferably less than 0.00001 . 

The polynucleotides originate from cDN A libraries derived from a variety of sources 
including, but not limited to, eukaryotes such as human, mouse, rat, dog, monkey, plant, and yeast; 
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prokaryotes such as bacteria; and viruses. These polynucleotides can also be selected from a variety 
of sequence types including, but not limited to, expressed sequence tags (ESTs), assembled 
polynucleotide sequences, full length gene coding regions, promoters, introns, enhancers, 5* 
untranslated regions, and 3' untranslated regions. To have statistically significant analytical results, 

5 the polynucleotides need to be expressed in at least three cDNA libraries. 

The cDNA libraries used in the coexpression analysis of the present invention can be 
obtained from adrenal gland, biliary tract, bladder, blood cells, blood vessels, bone marrow, brain, 
bronchus, cartilage, chromaffin system, colon, connective tissue, cultured cells, embryonic stem 
cells, endocrine glands, epithelium, esophagus, fetus, ganglia, heart, hypothalamus, immune system, 

10 intestine, islets of Langerhans, kidney, larynx, liver, lung, lymph, muscles, neurons, ovary, 
pancreas, penis, peripheral nervous system, phagocytes, pituitary, placenta, pleurus, prostate, 
salivary glands, seminal vesicles, 

skeleton, spleen, stomach, testis, thymus, tongue, ureter, uterus, and the like. The number of cDNA 
libraries selected can range from as few as 3 to greater than 10,000. Preferably, the number of the 

15 cDNA libraries is greater than 500. 

In a preferred embodiment, genes are assembled from related sequences, such as assembled 
sequence fragments derived from a single transcript. Assembly of the polynucleotide sequences can 
be performed using sequences of various types including, but not limited to, ESTs, extensions, or 
shotgun sequences. In a most preferred embodiment, the polynucleotide sequences are derived from 

20 human sequences that have been assembled using the algorithm disclosed in "Database and System 
for Storing, Comparing and Displaying Related Biomolecular Sequence Information", Lincoln a al 
USSN 09/276,534, filed March 25, 1999, incorporated herein by reference. 

Experimentally, differential expression of the polynucleotides can be evaluated by methods 
including, but not limited to, differential display by spatial immobilization or by gel electrophoresis, 

25 genome mismatch scanning, representational difference analysis, and transcript imaging. 

Additionally, differential expression can be assessed by microarray technology. These methods may 
be used alone or in combination. 

Known kidney disease-associated genes can be selected based on the use of these genes as 
diagnostic or prognostic markers or as therapeutic targets. Preferably, the known kidney disease- 

30 associated genes include uromodulin, NKCC2, NCCT, aldolase B, ROMK1, ATP1G1, PDZK1, 
NPT-1, calbindin, kininogen, and CIC-Kb, and the like. 

The procedure for identifying novel genes that exhibit a statistically significant coexpression 
pattern with known kidney disease-associated genes is as follows. First, the presence or absence of 
a gene in a cDNA library is defined: a gene is present in a cDNA library when at least one cDNA 
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fragment corresponding to that gene is detected in a cDNA sample taken from the library, and a 
gene is absent from a library when no corresponding cDNA fragment is detected in the sample. 

Second, the significance of gene coexpression is evaluated using a probability method to 
measure a due-to-chance probability of the coexpression. The probability method can be the Fisher 

5 exact test, the chi-squared test, or the kappa test. These tests and examples of their applications are 
well known in the art and can be found in standard statistics texts (Agresti (1990) Categorical Data 
Analysis . John Wiley & Sons, New York NY; Rice (1988) Mathematical Statistics and Data 
Analysis . Duxbury Press, Pacific Grove CA). A Bonferroni correction (Rice, supra , p. 384) can 
also be applied in combination with one of the probability methods for correcting statistical results 

10 of one gene versus multiple other genes. In a preferred embodiment, the due-to-chance probability is 
measured by a Fisher exact test, and the threshold of the due-to-chance probability is set preferably 
to less than 0.001 , more preferably to less than 0.00001 . 

To determine whether two genes, A and B, have similar coexpression patterns, occurrence 
data vectors can be generated as illustrated in Table 1. The presence of a gene occurring at least 

15 once in a library is indicated by a one, and its absence from the library, by a zero. 



Table 1. Occurrence data for genes A and B 





Library 1 


Library 2 


Library 3 




Library N ; 


gene A 


1 


1 


0 




0 


geneB 


1 


0 


1 




0 



20 

For a given pair of genes, the occurrence data in Table 1 can be summarized in a 2 x 2 contingency 
table. 

Table 2. Contingency table for co-occurrences of genes A and B 





Gene A present 


Gene A absent 


Total 


Gene B present 


8 


2 


10 


Gene B absent 


2 


18 


20 


Total 


10 


20 


30 



Table 2 presents co-occurrence data for gene A and gene B in a total of 30 libraries. Both 
30 gene A and gene B occur 10 times in the libraries. Table 2 summarizes and presents: 1) the number 
of times gene A and B are both present in a library; 2) the number of times gene A and B are both 
absent in a library; 3) the number of times gene A is present, and gene B is absent; and 4) the 
number of times gene B is present, and gene A is absent. The upper left entry is the number of 
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times the two genes co-occur in a library, and the middle right entry is the number of times neither 
gene occurs in a library. The off diagonal entries are the number of times one gene occurs, and the 
other does not. Both A and B are present eight times and absent 18 times. Gene A is present, and 
gene B is absent, two times; and gene B is present, and gene A is absent, two times. The probability 
5 ("p-value") that the above association occurs due to chance as calculated using a Fisher exact test is 
0.0003. Associations are generally considered significant if a p-value is less than 0.01 (Agresti, 
supra ; Rice, supra - ) . 

This method of estimating the probability for coexpression of two genes makes several 
assumptions. The method assumes that the libraries are independent and are identically sampled. 
10 However, in practical situations, the selected cDNA libraries are not entirely independent, because 
more than one library may be obtained from a single subject or tissue. Nor are they entirely 
identically sampled, because different numbers of cDNAs may be sequenced from each library. The 
number of cDNAs sequenced typically ranges from 5,000 to 10,000 cDNAs per library. In 
addition, because a Fisher exact coexpression probability is calculated for each gene versus 41,419 
15 other assembled genes, a Bonferroni correction for multiple statistical tests is necessary. 

Using the method of the present invention, we have identified nine novel genes that exhibit 
strong association, or coexpression, with known genes that are specific to kidney disease. These 
knownkidney disease-associated genes include uromodulin,NKCC2, NCCT, aldolaseB, ROMK1, 
ATP1G1, PDZK1, NPT-1, calbindin, kininogen, andCIC-Kb. The results presented in Table 4 
20 show that the expression of the nine novel genes have direct or indirect association with the 

expression of known kidney disease-associated genes. Therefore, the novel genes can potentially be 
used in diagnosis, treatment, prognosis, or prevention of diseases of the kidney or in the evaluation 
of therapies for diseases of the kidney. Further, the gene products of the nine novel genes are either 
potential therapeutic proteins or targets of therapeutics against diseases of the kidney. 
25 Therefore, in one embodiment, the present invention encompasses a polynucleotide sequence 

comprising the sequence of SEQ ID Nos:l-9. These nine polynucleotides are shown by the method 
of the present invention to have strong coexpression association with known kidney 
disease-associated genes and with each other. The invention also encompasses a variant of the 
polynucleotide sequence, its complement, or 18 consecutive nucleotides of a sequence provided in 
30 the above described sequences. Variant polynucleotide sequences typically have at least about 75%, 
more preferably at least about 85%, and most preferably at least about 95% polynucleotide 

sequence identity to NSEQ. 

NSEQ or the encoded PSEQ may be used to search against the GenBank primate (pri), 
rodent (rod), mammalian (mam), vertebrate (vrtp), and eukaryote (eukp) databases, SwissProt, 
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BLOCKS (Bairoch et al. (1997) Nucleic Acids Res. 25:217-221), PFAM, and other databases that 
contain previously identified and annotated motifs, sequences, and gene functions. Methods that 
search for primary sequence patterns with secondary structure gap penalties (Smith et al. (1992) 
Protein Engineering 5:35-51) as well as algorithms such as Basic Local Alignment Search Tool 

5 (BLAST; Altschul (1993) J. Mol. Evol. 36:290-300; Altschul et al. (1990) J. Mol. Biol. 

215:403-410), BLOCKS (Henikoff and Henikoff (1991) Nucleic Acids Res. 19:6565-6572), Hidden 
Markov Models (HMM; Eddy (1996) Cur. Opia Str. Biol. 6:361-365; Sonnhammer et al. (1997) 
Proteins 28:405-420), and the like, can be used to manipulate and analyze nucleotide and amino acid 
sequences. These databases, algorithms and other methods are well known in the art and are 

10 described in Ausubel et al. (1997; Short Protocols in Molecular Biology, John Wiley & Sons. New 
York NY, unit 7.7) and Meyers (1995; Modular Biology and Biotechnology, Wiley VCH, New 

York NY, pp. 856-853). 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to SEQ ID NOs:l-9, and fragments thereof under stringent conditions. Stringent 
15 conditions can be defined by salt concentration, temperature, and other chemicals and conditions 
well known in the art. Suitable conditions can be selected, for example, by varying the 
concentrations of salt in the prehybridization, hybridization, and wash solutions or by varying the 
hybridization and wash temperatures. With some substrates, the temperature can be decreased by 
adding formamide to the prehybridization and hybridization solutions. 
20 Hybridization can be performed at low stringency, with buffers such as 5xSSC with 1 % 

sodium dodecyl sulfate (SDS) at 60°C, which permits complex formation between two nucleic acid 
sequences that contain some mismatches. Subsequent washes are performed at higher stringency 
with buffers such as 0.2xSSC with 0. 1% SDS at either 45°C (medium stringency) or 68°C (high 
stringency), to maintain hybridization of only those complexes that contain completely 
25 complementary sequences. Background signals can be reduced by the use of detergents such as 
SDS, Sarcosyl, or Triton X-100, and/or a blocking agent, such as salmon sperm DNA. 
Hybridization methods are described in detail in Ausubel (supra, units 2.8-2. 1 1 . 3. 18-3. 19 and 
4.64.9) and Sambrook et al. (1989; MnWuinr Cloning. A Laboratory Manual, Cold Spring Harbor 
Press, Plainview NY). 

30 NSEQ can be extended utilizing a partial nucleotide sequence and employing various 

PCR-based methods known in the art to detect upstream sequences such as promoters and other 
regulatory elements. (See. e.g.. Dieffenbach and Dveksler (1995) PCR Primer, a Laboratory 
Manual. Cold Spring Harbor Press, Plainview NY.) Additionally, one may use an XL-PCR kit (PE 
Biosystems, Foster City, CA), nested primers, and commercially available cDNA libraries (Life 
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Technologies, Rockville MD) or genomic libraries (Clontech, Palo Alto CA) to extend the sequence. 
For all PCR-based methods, primers may be designed using commercially available software, such 
as OLIGO 4.06 Prima- Analysis software (National Biosciences, Plymouth MN) or another 
appropriate program, to be about 18 to 30 nucleotides in length, to have a GC content of about 

5 50%, and to form a hybridization complex at temperatures of about 68°C to 72°C. 

In another aspect of the invention, NSEQ can be cloned in recombinant DN A molecules that 
direct the expression of PSEQ, or structural or functional portions thereof, in appropriate host cells. 
Due to the inherent degeneracy of the genetic code, other DN A sequences which encode 
substantially the same or a functionally equivalent amino acid sequence may be produced and used 

1 0 to express the polypeptide encoded by NSEQ. The nucleotide sequences of the present invention 
can be engineered using methods generally known in the art in order to alter the nucleotide 
sequences for a variety of purposes including, but not limited to, modification of the cloning, 
processing, and/or expression of the gene product. DN A shuffling by random fragmentation and 
PCR reassembly of gene fragments and synthetic oligonucleotides may be used to engineer the 

15 nucleotide sequences. For example, oligonucleotide-mediated site-directed mutagenesis may be used 
to introduce mutations that create new restriction sites, alter glycosylation patterns, change codon 
preference, produce splice variants, and so forth. 

In order to express a biologically active protein, NSEQ, or derivatives thereof, may be 
inserted into an appropriate expression vector, i.e., a vector which contains the necessary elements 

20 for transcriptional and translational control of the inserted coding sequence in a particular host. 
These elements include regulatory sequences, such as enhancers, constitutive and inducible 
promoters, and 5 ' and 3 ' untranslated regions. Methods which are well known to those skilled in the 
art may be used construct such expression vectors. These methods include in vitro recombinant 
DNA techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, 

25 supra ; and Ausubel, supra .) 

A variety of expression vector/host cell systems may be utilized to express NSEQ. These 
include, but are not limited to, microorganisms such as bacteria transformed with recombinant 
bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with yeast 
expression vectors; insect cell systems infected with baculovirus vectors; plant cell systems 

30 transformed with viral or bacterial expression vectors; or animal cell systems. For long term 
production of recombinant proteins in mammalian systems, stable expression in cell lines is 
preferred. For example, NSEQ can be transformed into cell lines using expression vectors which 
may contain viral origins of replication and/or endogenous expression elements and a selectable or 
visible marker gene on the same or on a separate vector. The invention is not to be limited by the 
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vector or host cell employed. 

In general, host cells that contain NSEQ and that express PSEQ may be identified by a 
variety of procedures known to those of skill in the art. These procedures include, but are not 
limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and protein bioassay or 

5 immunoassay techniques which include membrane, solution, or chip based technologies for the 
detection and/or quantification of nucleic acid or protein sequences. Immunological methods for 
detecting and measuring the expression of PSEQ using either specific polyclonal or monoclonal 
antibodies are known in the art. Examples of such techniques include enzyme-linked 
immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell 

10 sorting (FACS). 

Host cells transformed with NSEQ may be cultured under conditions suitable for the 
expression and recovery of the protein from cell culture. The protein produced by a transgenic cell 
may be secreted or retained intracellularly depending on the sequence and/or the vector used. As 
will be understood by those of skill in the art, expression vectors containing NSEQ may be designed 

15 to contain signal sequences which direct secretion of the protein through a prokaryotic or eukaryotic 
cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylate, 

20 phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" 
form of the protein may also be used to specify protein targeting, folding, and/or activity. Different 
host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and W138) are available from the 
American Type Culture Collection (Manassas MD) and may be chosen to ensure the correct 

25 modification and processing of the expressed protein. 

In another embodiment of the invention, natural, modified or recombinant nucleic acid 
sequences are ligated to a heterologous sequence resulting in translation of a fusion protein 
containing heterologous protein moieties in any of the aforementioned host systems. Such 
heterologous protein moieties facilitate purification of fusion proteins using commercially available 

30 affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase, maltose 
binding protein, thioredoxin, calmodulin binding peptide, 6-His, FLAG, c-myc, hemaglutinin, and 
monoclonal antibody epitopes. 

In another embodiment, the nucleic acid sequences are synthesized, in whole or in part, 
using chemical or enzymatic methods well known in the art (Caruthers et al (1980) Nucleic Acids 



11 



WO 00/61622 



PCT/US00/08260 



Symp. Ser. (7) 215-233; Ausubel, supra ). For example, peptide synthesis can be performed using 
various solid-phase techniques (Roberge et al. (1995) Science 269:202-204), and machines such as 
the ABI 431 A Peptide synthesizer (PE Biosystems) can be used to automate synthesis. If desired, 
the amino acid sequence may be altered during synthesis and/or combined with sequences from other 
5 proteins to produce a variant protein. 

In another embodiment, the invention entails a substantially purified polypeptide comprising 
the amino acid sequence selected from the group consisting of SEQ ID NOs:10-12 and fragments 
thereof. 

DIAGNOSTICS and THERAPEUTICS 

10 The polynucleotide sequences can be used in diagnosis, prognosis, treatment, prevention, 

and evaluation of therapies for diseases of the kidney including, but not limited, Bartter's syndrome, 
Gitelman syndrome, nephrolithiasis, renal amyloidosis, hypertension; primary aldosteronism; 
Addison's disease; renal failure; glomerulonephritis; chronic glomerulonephritis; tubulointerstitial 
nephritis; cystic disorders of the kidney and dysplastic malformations such as polycystic disease, 

15 renal dysplasias, and cortical or medullary cysts; inherited polycystic renal diseases (PRD), such as 
recessive and autosomal dominant PRD; medullary cystic disease; medullary sponge kidney and 
tubular dysplasia; Alport's syndrome; non-renal. cancers which affect renal physiology, such as 
bronchogenic tumors of the lungs or tumors of the basal region of the brain; multiple myeloma; 
adenocarcinomas of the kidney; metastatic renal carcinoma; in addition, nephrotoxic disorders 

20 include any functional or morphologic change in the kidney produced by any pharmaceutical, 
chemical, or biological agent that is ingested, injected, inhaled, or absorbed. 

In one preferred embodiment, the polynucleotide sequences are used for diagnostic purposes 
to determine the absence, presence, and excess expression of the proteia The polynucleotides may 
be at least 18 nucleotides long and consist of complementary RNA and DNA molecules, branched 

25 nucleic acids, and/or peptide nucleic acids (PNAs). In one alternative, the polynucleotides are used 
to detect and quantify gene expression in samples in which expression of NSEQ is correlated with 
disease. In another alternative, NSEQ can be used to detect genetic polymorphisms associated with 
a disease. These polymorphisms may be detected in the transcript cDNA. 

The specificity of the probe is determined by whether it is made from a unique region, a 

30 regulatory region, or from a conserved motif. Both probe specificity and the stringency of 
diagnostic hybridization or amplification (maximal, high, intermediate, or low) will determine 
whether the probe identifies only naturally occurring, exactly complementary sequences, allelic 
variants, or related sequences. Probes designed to detect related sequences should preferably have 
at least 75% sequence identity to any of the nucleic acid sequences encoding PSEQ. 
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Methods for producing hybridization probes include the cloning of nucleic acid sequences 
into vectors for the production of mRNA probes. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by adding appropriate 
RNA polymerases and labeled nucleotides. Hybridization probes may incorporate nucleotides 
5 labeled by a variety of reporter groups including, but not limited to, radionuclides such as 32 P or 35 S, 
enzymatic labels such as alkaline phosphatase coupled to the probe via avidin/biotin coupling 
systems, fluorescent labels, and the like. The labeled polynucleotide sequences may be used in 
Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 
technologies; and in microarrays utilizing samples from subjects to detect altered NSEQ expression. 
10 NSEQ can be labeled by standard methods and added to a sample from a subject under 

conditions suitable for the formation and detection of hybridization complexes. After incubation the 
sample is washed, and the signal associated with hybrid complex formation is quantitated and 
compared with a standard value. Standard values are derived from any control sample, typically 
one that is free of the suspect disease. If the amount of signal in the subject sample is altered in 
15 comparison to the standard value, then the presence of altered levels of expression in the sample 
indicates the presence of the disease. Qualitative and quantitative methods for comparing the 
hybridization complexes formed insubject samples with previously established standards are well 
known in the art. 

Such assays may also be used to evaluate the efficacy of a particular therapeutic treatment 
20 regimen in animal studies, in clinical trials, or to monitor the treatment of an individual subject. 
Once the presence of disease is established and a treatment protocol is initiated, hybridization or 
amplification assays can be repeated on a regular basis to determine if the level of expression in the 
subject begins to approximate that which is observed in a healthy subject. The results obtained 
from successive assays may be used to show the efficacy of treatment over a period ranging from 

25 several days to many years. 

The polynucleotides may be used for the diagnosis of a variety of diseases associated with 

the 

kidney. These include, but are not limited to, Banter's syndrome, Gitelman syndrome, 
nephrolithiasis, renal amyloidosis, hypertension; primary aldosteronism; Addison's disease; renal 
30 failure; glomerulonephritis; chronic glomerulonephritis; tubulointerstitial nephritis; cystic disorders 
of the kidney and dysplastic malformations such as polycystic disease, renal dysplasias, and cortical 
or medullary cysts; inherited polycystic renal diseases (PRD), such as recessive and autosomal 
dominant PRD; medullary cystic disease; medullary sponge kidney and tubular dysplasia; Alport's 
syndrome; non-renal cancers which affect renal physiology, such as bronchogenic tumors of the 
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lungs or tumors of the basal region of the brain; multiple myeloma; adenocarcinomas of the kidney; 
metastatic renal carcinoma; in addition, nephrotoxic disorders include any functional or morphologic 
change in the kidney produced by any pharmaceutical, chemical, or biological agent that is ingested, 

injected, inhaled, or absorbed. 
5 The polynucleotides may also be used as targets in a microarray. The microarray can be 

used to monitor the expression patterns of large numbers of genes simultaneously and to identify 
splice variants, mutations, and polymorphisms. Information derived from analyses of the expression 
patterns may be used to determine gene function, to understand the genetic basis of a disease, to 
diagnose a disease, and to develop and monitor the activities of therapeutic agents used to treat a 
10 disease. Microarrays may also be used to detect genetic diversity, single nucleotide polymorphisms 
which may characterize a particular population, at the genome level. 

In yet another alternative, polynucleotides may be used to generate hybridization probes 
useful in mapping the naturally occurring genomic sequence. Fluorescent in situ hybridization may 
be correlated with other physical chromosome mapping techniques and genetic map data as 
15 described in Heinz-Ulrich et al. (In: Meyers; supra, pp. 965-968). 

In another embodiment, antibodies or antibody fragments comprising an antigen binding site 
that specifically binds PSEQ may be used for the diagnosis of diseases characterized by the 
over-or-under expression of PSEQ. A variety of protocols for measuring PSEQ, including ELISAs, 
RIAs, and FACS, are well known in the art and provide a basis for diagnosing altered or abnormal 
20 levels of expression. Standard values for PSEQ expression are established by combining samples 
taken from healthy subjects, preferably human, with antibody to PSEQ under conditions suitable for 
complex formation. The amount of complex formation may be quantitated by various methods, 
preferably by photometric means. Quantities of PSEQ expressed in disease samples are compared 
with standard values. Deviation between standard and subject values establishes the parameters for 
25 diagnosing or monitoring disease. Alternatively, one may use competitive drug screening assays in 
which neutralizing antibodies capable of binding PSEQ specifically compete with a test compound 
for binding the protein. Antibodies can be used to detect the presence of any peptide which shares 
one or more antigenic determinants with PSEQ. In one aspect, the anti-PSEQ antibodies of the 
present invention can be used for treatment or monitoring therapeutic treatment for diseases of the 
30 kidney. 

In another aspect, the NSEQ, or its complement, may be used therapeutically for the 
purpose of expressing mRN A and protein, or conversely to block transcription or translation of the 
mRNA. Expression vectors may be constructed using elements from retroviruses, adenoviruses, 
herpes or vaccinia viruses, or bacterial plasmids, and the like. These vectors may be used for 
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delivery of nucleotide sequences to a particular target organ, tissue, or cell population. Methods 
well known to those skilled in the art can be used to construct vectors to express nucleic acid 
sequences or their complements. (See, e.g.. Mauliket al. (1997) Molecular Biotechnology, 
TMapaotic Annlicatk™ and Strategies. Wiley-Liss, New York NY.) Alternatively, NSEQ, or its 
5 complement,maybeusedforsomaticcellorstemcellgenetherapy. Vectors may be introduced in 
vivo, in vitro, and ex vivo. For ex yjyo therapy, vectors are introduced into stem cells taken from 
the subject, and the resulting transgenic cells are clonally propagated for autologous transplant back 
into that same subject. Delivery of NSEQ by transfection, liposome injections, or polycationic 
amino polymers may be achieved using methods which are well known in the art. (See, e.g., 
10 Goldman et al. (1997) Nature Biotechnology 15:462-466.) Additionally, endogenous NSEQ 
expression may be inactivated using homologous recombination methods which insert an inactive 
gene sequence into the coding region or other appropriate targeted region of NSEQ. (See, e.g. 
Thomas et al. (1987) Cell 51: 503-512.) 

Vectors containing NSEQ can be transformed into a cell or tissue to express a missing 
15 protein or to replace a nonfunctional protein. Similarly a vector constructed to express the 
complement of NSEQ can be transformed into a cell to downregulate the overexpression of PSEQ. 
Complementary or antisense sequences may consist of an oligonucleotide derived from the 
transcription initiation site; nucleotides between about positions -10 and +10 from the ATG are 
preferred. Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple 
20 helix pairing is useful because it causes inhibition of the ability of the double helix to open 

sufficiently for the binding of polymerases, transcription factors, or regulatory molecules. Recent 
therapeutic advances using triplex DNAhave been described in the literature. (See, e.g., Gee et al. 
In: Huber and Carr (1994) Mnlrrnlar andlmm-nmnric Anoroaches, Futura Publishing, Mt. Kisco 
NY, pp. 163-177.) 

25 Ribozymes, enzymatic RN A molecules, may also be used to catalyze the cleavage of 

mRN A and decrease the levels of particular mRNAs, such as those comprising the polynucleotide 
sequences of the invention. (See, e.g., Rossi (1994) Current Biology 4: 469-471.) Ribozymes may 
cleave mRNA at specific cleavage sites. Alternatively, ribozymes may cleave mRNAs at locations 
dictated by flanking regions that form complementary base pairs with the target mRNA. The 

30 construction and production of ribozymes is well known in the art and is described in Meyers 
(supra) . 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to. the addition of flanking sequences at the 5' and/or 3' 
ends of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiester 
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linkages within the backbone of the molecule. Alternatively, nontraditional bases such as inosine, 
queosine. and wybutosine, as well as acetyl, methyl-, thio-, and similarly modified forms of 
adenine, cytidine, guanine, thymine, and uridine, which are not as easily recognized by endogenous 
endonucleases, may be included. 
5 Further, an antagonist, or an antibody that binds specifically to PSEQ may be administered 

to a subject to treat or prevent a disease of the kidney. The antagonist, antibody, or fragment may 
be used directly to inhibit the activity of the protein or indirectly to deliver a therapeutic agent to 
cells or tissues which express the PSEQ. An immunoconjugate comprising a PSEQ binding site of 
the antibody or the antagonist and a therapeutic agent may be administered to a subject in need to 
10 treat or prevent disease. The therapeutic agent may be a cytotoxic agent selected from a group 
including, but not limited to, abrin, ricin, doxorubicin, daunorubicin, taxol, ethidiurn bromide, 
mitomycin, etoposide, tenoposide. vincristine, vinblastine, colchicine, dihydroxy anthracin dione, 
actinomycin D, diphteria toxin, Pseudomonas exotoxin A and 40, radioisotopes, and glucocorticoid. 
Antibodies to PSEQ may be generated using methods that are well known in the art. Such 
15 antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain 
antibodies, Fab fragments, and fragments produced by a Fab expression library. Neutralizing 
antibodies, such as those which inhibit dimer formation, are especially preferred for therapeutic use. 
Monoclonal antibodies to PSEQ may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not 
20 limited to, the hybridoma. the human B-cell hybridoma, and the EBV-hybridoma techniques. In 
addition, techniques developed for the production of chimeric antibodies can be used. (See, e.g., 
Pound (1998) ImaunocMca] Protocols. Methods Mol. Biol. Vol. 80.) Alternatively, techniques 
described for the production of single chain antibodies may be employed. Antibody fragments 
which contain specific binding sites for PSEQ may also be generated. Various immunoassays may 
25 beusedtoidentifyantibodieshavingthedesiredspecificity. Numerous protocols for competitive 
binding or immunoradiometric assays using either polyclonal or monoclonal antibodies with 
established specificities are well known in the art. 

Yet further, an agonist of PSEQ may be administered to a subject to treat or prevent a 
disease associated with decreased expression, longevity or activity of PSEQ. 
30 An additional aspect of the invention relates to the administration of a pharmaceutical or 

sterile composition, in conjunction with a pharmaceutically acceptable carrier, for any of the 
therapeutic applications discussed above. Such pharmaceutical compositions may consist of PSEQ 
or antibodies, mimetics, agonists, antagonists, or inhibitors of the polypeptide. The compositions 
may be administered alone or in combination with at least one other agent, such as a stabilizing 
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compound, which may be administered in any sterile, biocompatible pharmaceutical carrier 
including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be 
administered to a subject alone or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
5 number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain 
suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which facilitate 
10 processing of the active compounds into preparations which can be used pharmaceutical^. Further 
details on techniques for formulation and administration may be found in the latest edition of 
Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays or in animal models such as mice, rats, rabbits, dogs, or pigs. An animal model may 
1 5 also be used to determine the appropriate concentration range and route of administration. Such 
information can then be used to determine useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient which ameliorates 
the symptoms or condition. Therapeutic efficacy and toxicity may be determined by standard 
pharmaceutical procedures in cell cultures or with experimental animals, such as by calculating and 
20 contrasting the ED 50 (the dose therapeutically effective in 50% of the population) and LD 50 (the dose 
lethal to 50% of the population) statistics, Any of the therapeutic compositions described above 
may be applied to any subject in need of such therapy, including, but not limited to, mammals such 
as dogs, cats, cows, horses, rabbits, monkeys, and most preferably, humans. 

EXAMPLES 

25 It is to be understood that this invention is not limited to the particular devices, machines, 

materials and methods described. Although particular embodiments are described, equivalent 
embodiments may be used to practice the invention. The described embodiments are not intended to 
limit the scope of the invention which is limited only by the appended claims. The examples below 
are provided to illustrate the subject invention and are not included for the purpose of limiting the 

30 invention, 

I cDNA Library Construction 

The KIDNNOT 19 library, in which Incyte clones 2673009 and 2675922 were discovered, 
was constructed from RNA isolated from kidney tissue removed from a 65-year-old male during an 
exploratory laparotomy and nephroureterectomy. Pathology for the associated tumor tissue 
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indicated a grade 1 renal cell carcinoma, clear cell type, forming a variegated mass situated within 
the upper pole of the left kidney. Patient presented with abdominal pain. Padent history included 
benign hypertension and medications included verapamil, lisinopril, and aspirin. The frozen tissue 
was homogenized and lysed in TRIZOL reagent (10 g/10 ml; Life Technologies), using a 
5 POLYTRON homogenizer (PT-3000; Brinkmann Instruments, Westbury NY). Following 

homogenization, chloroform was added to the homogenate (at a rado of 1 :5 chloroform:homogenate, 
v/v), and the phases separated by centrifugation. The aqueous phase was removed to a fresh tube 
and mixed with isopropanol. Precipitated RNA was resuspended in DEPC-treated water and treated 
with DNAse for 25 minutes at 37°C. The RNA was re-extracted with acid phenol-chloroform, pH 
10 4.7, and precipitated with 0.3M sodium acetate and 2.5 volumes 100% ethanol. The mRNA was 
isolated with the OLIGOTEX kit (Qiagen, Valencia CA) and used to construct the cDNA library. 

The mRNA was handled according to the recommended protocols in the SUPERSCRIPT 
plasmid system (Life Technologies). The cDNAs were fracdonated on a SEPH AROSE CL413 
column (Amersham Pharmacia Biotech, Piscataway NJ), and those cDNAs exceeding 400 bp were 
15 ligated into pINCY plasmid (Incyte Pharmaceuticals, Palo Alto CA). The plasmid was 
subsequendy transformed into DH5a competent cells (Life Technologies). 
II Isolation and Sequencing of cDNA Clones 

Plasmid DNA was released from the cells and purified using the REAL PREP 96 plasmid 
kit (Qiagen). This kit enabled the simultaneous purification of 96 samples in a 96-well block using 
20 multi-channel reagent dispensers. The recommended protocol was employed except for the 

following changes: 1) the bacteria were cultured in 1 ml of sterile Terrific Broth (Life Technologies) 
with carbemcillin at 25 mg/1 and glycerol at 0.4%; 2) after inoculation, the cultures were incubated 
for 19 hours; at the end of incubation, the cells were lysed with 0.3 ml of lysis buffer; and 3) 
following isopropanol precipitation, the plasmid DNA pellet was resuspended in 0. 1 ml of distilled 
25 water, and samples were transferred to a 96-well block for storage at 4°C. 

The cDNAs were prepared using a MICROLAB 2200 (Hamilton, Reno NV) in 
combination with DNA ENGINE thermal cycler (PTC200; MJ Research, Watertown MA). cDNAs 
were sequenced by the method of Sanger et al. (1975, J. Mol. Biol. 94:4410 using ABI PRISM 377 
DNA sequencing systems (PE Biosystems) or MEGABASE 1000 sequencing systems (Amersham 

30 Pharmacia Biotech). 

Most of the sequences disclosed herein were sequenced using standard ABI protocols and 
kits (PE Biosystems). The solution volumes were used at 0.25x -1 .Ox concentrations. Some of the 
sequences disclosed herein were sequenced using solutions and dyes from Amersham Pharmacia 
Biotech 
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III Selection, Assembly, and Characterization of Sequences 

The sequences used for coexpression analysis were assembled from EST sequences, 5 r and 
3* longread sequences, and full length coding sequences. Selected assembled sequences were 
expressed in at least three cDNA libraries. 

5 The assembly process is described as follows. EST sequence chromatograms were 

processed and verified. Quality scores were obtained using PHRED (Ewing et al. (1998) Genome 
Res. 8:175-185; Ewing and Green (1998) Genome Res. 8:186-194), and edited sequences were 
loaded into a relational database management system (RDBMS). The sequences were clustered 
using BLAST with a product score of 50. All clusters of two or more sequences created a bin, and 

10 each bin with its resident sequences represents one transcribed gene. 

Assembly of the component sequences within each bin was performed using a modification 
of PHRAP, a publicly available program for assembling DNA fragments (Green, P. University of 
Washington Seattle WA). Bins that showed 82% identity from a local pair-wise alignment between 
any of the consensus sequences were merged, 

15 Bins were annotated by screening the consensus sequence in each bin against public 

databases, such as GBpri and GenPept from NCBI. The annotation process involved a FASTn 
screen against the GenBank GBpri database. Those hits with a percent identity of greater than or 
equal to 75% and an alignment length of greater than or equal to 100 base pairs were recorded as 
homolog hits. The residual unannotated sequences were screened by FASTx against GenPept. 

20 Those hits with an E value of less than or equal to 10" 8 were recorded as homolog hits. 

Sequences were then reclustered using BLASTn and Cross-Match, a program for rapid 
protein and nucleic acid sequence comparison and database search (Green, supra ), sequentially. 
Any BLAST alignment between a sequence and a consensus sequence with a score greater than 150 
was realigned using cross-match. The sequence was added to the bin whose consensus sequence 

25 gave the highest Smith-Waterman score (Smith, supra) among local alignments with at least 82% 
identity. Non-matching sequences were moved into new bins, and assembly processes were 
performed for the new bins. ^ 

IV Coexpression Analyses of Known Kidney Disease- Associated Genes 

Eleven known kidney disease-associated genes were selected to identify novel genes that are 
30 closely associated with diseases of the kidney. These known genes were uromodulin, NKCC2, 
NCCT, aldolase B, ROMK1, ATP1G1, PDZK1, NPT-1, calbindin, kininogen, andCIC-Kb. The 
kidney disease- associated genes which were examined in this analysis and brief descriptions of their 
functions are listed in Table 3. 

Table 3. Descriptions of Known Kidney Disease-Associated Genes 
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GENE E 
1 

Uromddulin 11 
r 
e 


F.SCRIPTION ANU Kiirr.i\J^i> ■ I 

_nm-Horsfall glycoprotein; immuno-suppressive; binds IL- la, 0, TNF 
narker for distal tubular function; marker for chrome f^^J" 
aplastic) nephrotoxicity, implicated in kidney stone ormauon (Won 
* al (1997) Br. J. Urol. 80:533-538; MacLeanet _ (1998) Cancer 
-htnother Pharmacol 41:413-416; Torffvitet al. (1998) Nephron 
79:167-172) . . 


I i 

NKCC2 


Bumetanide-sensitive Na (K>C1 2 co-transporter; mutant NKCC2 leads 
to neonatal Bartter's syndrome (hypokalemic alkalosis, low blood 
nriTure hvoercalcinuria); implicated in autosomal dominant polycystic 
S7«Sm* Nat. Genet. 13:183-188; Sulhvan et 
al (1998) Physiol. Rev. 78:1165-1191) _________ 


NCCT 


ThiaTirlp sensitive Na (K*)-C1, co-transporter; mutant NCCT leads to 
Gitelman syndrome (hypokalaemic alkalosis, hypotension, 
hypocalcinuria) (Mastroianni et _ (1996) Am, J. Hum. Genet. 
59:1019-1026) 


Aldolase B 


-„.otn«. hi^nhosnhate aldolase; mutation in aldolase B leads to fructose 
86 25-30; Ali et al. (1998) J. Med. Genet. 35:353-365) 


ROMK1 


t ,„orHiv r^rtifvins voltaee-gated K + channel; mutaUon in ROMK1 leads 
^SSSfiS&V**** (1994) Mc, P = 
45:854-860; Derst et al. (1997) Biochem. Biophys. Res. Commun 
230:641-645) ■ 


ATP1G1 


Na7K + -ATPase gamma subunit; required for acquisition of fluid 
transport capacity (Jones et al (1997) J. Cell Jfcol 139.1545-1552, Kim 
etal (1997) Biochim. Biophys. Acta 1350:133-135) 


[pDZKl 


PDZ domain containing protein; co-localizes with ion channels (Keener 
gal. (1998) Lab. Invest. 78:117-125) 


NPT-1 


' Na'-dependent phosphate co-transporter; candidate for heredity 
hypophosphatemia with hypercalcemia (Miyamoto et _ (1995) 

305:81-85; Miyamotoetal. (1995) Tokushima J. Exp. Med. 42:5-9) 


) Calbindin 


" rairium bindine Drotein, vitamin D-dependent; excess protein levels in 

(1996) Kidney Int. Suppl. 55S.166-168; Aicher et _ (1998) 
Electrophoresis 19:1988-2003) 


Kininogen 


" Cysteine protease inhibitor; regulates blood pressure in conjunction with 
renal kallikrein (Sharma et al. (1996) Gen. Pharmacol. 27:55-63) 


CIC-Kb 


Chloride channel; mutations lead to classic B artter 's syndrome 
(Rodriguez- Soriano (1998) Pediatr. Nephrol. 12:315-327) 



From a total of 41,419 assembled gene sequences, we have identified nine novel genes that 



20 



WO 00/61622 



PCT/USOO/08260 



show strong association with 1 1 known kidney disease-associated genes. Initially, the degree of 
association was measured by probability values using a cutoff p value less than 0.00001 . The 
sequences were further examined to ensure that the genes that passed the probability test had strong 
association with known kidney disease-associated genes. The process was reiterated so that the 
: reduced to the final nine kidney disease genes. Details of the expression 



5 initial 41 ,419 genes were i 
patterns for the 1 1 known 



and nine novel kidney disease genes are presented in Table 4. 



4. Co-Expression of Nine Novel Genes and 11 Known Kidney Disease Genes 



10 



15 



Table 

H°BP) — 

Name 1 2 3 4 5 6 7 8 

19 



20 



9 10 11 12 13 14 15 16 17 18 



25 



30 



1 Aldolase B 

2 PDZK1 

3 ATP1G1 

4 Kininogen 

5 NKCC2 

6 NCCT 

7 NPT-1 

8 Uromodulin 

9 ROMK1 
lOCIC-Kb 

11 Calbindin 

12 2675922 

13 3534377 

14 3481942 

15 2911650 

16 1900433 

17 2580580 

18 2673009 

19 1399234 

20 1867351 



9 

10 13 

21 9 6 

7 10 7 6 

5 9 10 6 10 

1 7 6 4 0 1 

9 13 12 7 12 13 

6 8 7 9 6 7 



4 
4 
3 
1 
4 
3 
5 
6 
1 



6 6 

7 4 

5 5 

6 5 

7 6 
9 4 

6 5 
9 7 

12 10 

8 5 

7 3 



5 

5 

4 

7 

5 

2 

5 

3 

5 

7 

2 



6 

8 

7 

8 

4 

7 

8 

3 

7 

7 

1 



9 

7 

7 

8 

5 

6 

9 

5 

7 

6 

1 



1 

0 

1 

0 

0 

3 

3 

0 

6 

4 

1 

6 



9 

8 

8 

6 

7 

5 

8 

7 

5 

7 

8 

3 



8 

5 

3 

6 

8 

3 

7 

5 

6 

8 

3 



4 
4 
5 
4 

3 

10 
3 
5 
4 
1 



4 
5 
6 
5 
5 
3 
6 
5 
1 



6 

3 6 

1 1 
5 6 

2 3 
7 8 

4 6 
1 1 



4 
3 
4 

2 
2 



3 
6 
4 
1 



5 

3 2 



We examined genes that are coexpressed with the 1 1 known kidney disease-associated 
genes, and identified nine novel genes that are strongly coexpressed. Each of the nine novel genes is 
coexpressed with at least one of the 1 1 known genes with a p-value of less than 1 0e-05 . The 
coexpressionoftheninenovelgeneswiththell known genes are shown in Table 4. Theentriesin 
35 T able4arethenegadvelogofthe^^ TbcBOWd 
genes identified are listed by their Incyte clone numbers, and the known genes, by their names or 
abbreviauonsasshowninTable3. For convenience, all the genes in Table 4 are assigned an 

identifying number, 1 to 20. 
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V Novel Genes Associated with Kidney Diseases 

Using the co-expression analysis method, we have identified nine novel genes that exhibit 
strong association, or co-expression, with 11 known kidney disease-associated genes. 

Nucleic acids comprising the consensus sequences of SEQ ID NOs:l-9 of the present 
5 invention were first identified from Incyte Clones 2675922, 3534377. 3481942. 2911650, 1900433, 
2580580, 2673009. 1399234. and 1867351. respectively, and assembled as described in Example 
III BLAST was performed for SEQ ID NOs:l-9 as described in Example VI. SEQIDNOs:l-9 
were translated and sequence identity was sought via motif searches and BLAST comparison to 
known sequences. SEQ ID NOs:10-12 of the present invention were encoded by the nucleic acids of 
10 SEQ ID NO:3. SEQ ID NO:5. and SEQ ID NO:7. respectively. SEQ ID NOs:10-12 were also 
analyzed using BLAST and other motif search tools as disclosed in Example VI. Analyses of the 

novel genes is as follows. 

SEQ ID NO:l (Incyte clone 2675922) is 613 nucleotides in length and has no significant 
knownhomologsinanyofttepublic^ SEQ ID NO:2 (Incyte 

15 clone 3534377) is 618 nucleotides in length and has no significant known homologs in any of the 
public databases described in this application. SEQ ID NO:3 (Incyte clone 3481942) is 512 
nucleotidesin length and has about 71% identity to the nucleic acid sequence of a rat 
Kidney-specific protein ( g 3127192). SEQ ID NO:4 (Incyte clone 2911650) is 999 nucleotides in 
length and has no significant known homologs in any of the public databases described in this 
20 application. SEQ ID NO:5 (Incyte clone 1900433) is 616 nucleotides in length and has about 41 % 
identity to the nucleic acid sequence of a human MAT8 protein (gl085025). SEQ ID NO:6 (Incyte 
clone 2580580) is 862 nucleotides in length and has no significant known homologs in any of the 
public databases described in this application. SEQ ID NO:7 (Incyte clone 2673009) is 1090 
nucleotides in length and has no significant known homologs in any of the public databases 
25 describedinthisapphcauon. SEQ ID NO:8 (Incyte clone 1399234) is 1108 nucleotides in length 
andhas no significant known homologs in any of thepublic databases described in this application. 
SEQ ID NO-9 (Incyte clone 186735 1) is 1290 nucleotides in length and has no significant known 
homologs in any of thepublic databases described in this application. SEQ ID NO:10 has 139 
amino acids which are encoded by SEQ ID NO:3 and has about 86% identity to the amino acid 
30 sequenceofaratkidney-specificprotein ( g3127193). Motif analyses of SEQ ID NO: 10 show two 
potentia.phosphorylationsitesatresiduesS19andTI37. SEQ ID NO: 11 has 89 amino acids 
which are encoded by SEQ ID NO:5 and has about 71% identity to the amino acid sequence of a 
mouse MAT8 protein (g 1085068). Motif analyses of SEQ ID NO: 1 1 show three potential 
phosphorylation sites at residues T15. S58. and S66. as well as a signal peptide sequence from 
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residues Ml through A19. and a MAT8 family signature sequence from residue D28 through Y63. 
SEQ ID NO:12 has 285 amino acids which are encoded by SEQ ID NO:7. Motif analyses of SEQ 
ID NO:12 show nine potential phosphorylation sites at residues S27, T43, S67, T72, S87, T160. 

Y179,T227,andS236. 
5 VI Homology Searching for Kidney Disease Genes and Their Encoded Proteins 

The polynucleotide sequences, SEQ ID NOs: 1-9, and polypeptide sequences, SEQ ID 
NOs:10-12, were queried against databases derived from sources such as GenBank and SwissProt. 
These databases, which contain previously identified and annotated sequences, were searched for 
regions of similarity using BLAST (Altschul, supra). BLAST searched for matches and reported 
10 only those that satisfied the probability thresholds of 10"* or less for nucleotide sequences and 10* 
or less for polypeptide sequences. 

The polypeptide sequences were also analyzed for known motif patterns using MOTIFS. 
SPSCAN BLIMPS, and HMM-based protocols. MOTIFS (Genetics Computer Group, Madison 
Wl) searches polypeptide sequences for patterns that match those defined in the Prosite Dictionary 
15 of Protein Sites and Patterns (B airoch. supra) and displays the patterns found and their 

corresponding literature abstracts. SPSCAN (Genetics Computer Group) searches for potential 
signal peptide sequences using a weighted matrix method (Nielsen et al. (1997) Prot. Eng. 10:1-6). 
Hits with a score of 5 or greater were considered. BLIMPS uses a weighted matrix analysis 
algorithm to search for sequence similarity between the polypeptide sequences and those contained in 
20 BLOCKS, a database consisting of short amino acid segments, or blocks of 3-60 amino acids in 
length, compiled from the PROSITE database (Henikoff. supra; Bairoch. supra), and those in 
PRINTS, a protein fingerprint database based on nonredundant sequences obtained from sources 
such as SwissProt, GenBank, PIR, and NRL-3D (Attwood et al. (1997) J. Chen, Inf, Comput. Sci. 
37-417-424). For the purposes of the present invention, the BLIMPS searches reported matches 
25 wimacutoffscoreoflOOOor^ HMM-based 
protocols were based on a probabilistic approach and searched for consensus primary structures of 
gene families in the protein sequences (Eddy, supra; Sonnhammer. supra). More than 500 known 
protein families with cutoff scores ranging from 10 to 50 bits were selected for use in this invention. 
VII Labeling of Probes and Hybridization Analyses 
30 Blotting 

Polynucleotide sequences are isolated from a biological source and applied to a solid matrix 
(a blot) suitable for standard nucleic acid hybridization protocols by one of the following methods. 
A mixture of target nucleic acids is fractionated by electrophoresis through an 0.7% agarose gel in 
lx TAE [40 mM Tris acetate. 2 mM ethylenediamine tetraacetic acid (EDTA)] running buffer and 
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transferred to a nylon membrane by capillary transfer using 20x saline sodium citrate (SSC). 
Alternatively, the target nucleic acids are individually ligated to a vector and inserted into bacterial 
host cells to form a library. Target nucleic acids are arranged on a blot by one of the following 
methods. In the first method, bacterial cells containing individual clones are robotically picked and 
5 arranged on a nylon membrane. The membrane is placed on bacterial growth medium, LB agar 
containing carbenicillin, and incubated at 37'C for 16 hours. Bacterial coloies are denatured, 
neutralized, and digested with proteinase K. Nylon membranes are exposed to UV irradiation in a 
STRATALINKER UV-crosslinker (Stratagene, La Jolla CA) to cross-link DNA to the membrane. 
In the second method, target nucleic acids are amplified from bacterial vectors by thirty 
10 cycles of PCR using primers complementary to vector sequences flanking the insert. Amplified 
target nucleic acids are purified using SEPHACRYL-400 (Amersharn Pharmacia Biotech). Purified 
target nucleic acids are robotically arrayed onto a glass microscope slide; The slide was previously 
coated with 0.05% aminopropyl silane (Sigma-Aldrich, St. Louis MO) and cured at 1 10°C. The 
arrayed glass slide (microarray) is exposed to UV irradiation in a STRATALINKER 
15 UV-crosslinker (Stratagene). 
Probe Preparation 

cDN A probe sequences are made from mRNA templates. Five micrograms of mRNA is 
mixed with lpg random primer (Life Technologies), incubated at 70°C for 10 minutes, and 
Iyophilized. The Iyophilized sample is resuspended in 50 pi of lx first strand buffer (cDNA 

20 Synthesis system; Life Technologies) containing a dNTP mix, [a- 32 P]dCTP, dithiothreitol, and 
MMLV reverse transcriptase (Stratagene), and incubated at 42°C for 1-2 hours. After incubation, 
the probe is diluted with 42 pi dH 2 0, heated to 95°C for 3 minutes, and cooled on ice. mRNA in the 
probe is removed by alkaline degradation. The probe is neutralized, and degraded mRNA and 
unincorporated nucleotides are removed using a PROBEQUANT G-50 microcolumn (Amersharn 

25 Pharmacia Biotech). Probes can be labeled with fluorescent markers, Cy3-dCTP or Cy5-dCTP 
(Amersharn Pharmacia Biotech), in place of the radionuclide, [ 32 P]dCTP. 
Hybridization 

Hybridization is carried out at 65°C in a hybridization buffer containing 0.5 M sodium 
phosphate (pH 7.2), 7% SDS, and 1 mM EDTA After the blot is incubated in hybridization buffer 
30 at 65°C for at least 2 hours, the buffer is replaced with 10 ml of fresh buffer containing the probe 
sequences. After incubation at 65°C for 18 hours, the hybridization buffer is removed, and the blot 
is washed sequentially under increasingly stringent conditions, up to 40 mM sodium phosphate, \% 
SDS, 1 mM EDTA at 65°C. To detect signal produced by a radiolabeled probe hybridized on a 
membrane, the blot is exposed to a PHOSPHORIMAGER cassette (Amersharn Pharmacia Biotech), 
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and the image is analyzed using IMAGEQUANT data analysis software (Amersham Pharmacia 
Biotech). To detect signals produced by a fluorescent probe hybridized on a microarray, the blot is 
examined by cbnfocal laser microscopy, and images are collected and analyzed using GEMTOOLS 
gene expression analysis software (Incyte Pharmaceuticals). 
5 VIII Production of Specific Antibodies 

SEQ ID NOs:10-12, or portions thereof, substantially purified using polyacrylamide gel 
electrophoresis or other purification techniques, is used to immunize rabbits and to produce 
antibodies using standard protocols as described in Pound (supra). 

Alternatively, the amino acid sequence is analyzed using LASERGENE software 
10 (DNASTAR. Madison WI) to determine regions of high immunogenicity, and a corresponding 
oligopeptide is synthesized and used to raise antibodies by means known to those of skill in the art. 
Methods for, selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic 
regions are well described in the art. Typically, oligopeptides 15 residues in length are synthesized 
using an ABI 431 A Peptide synthesizer (PE Biosystems) using Fmoc-chemistry and coupled to 
15 keyhole limpet hemocyanin (KLH, Sigma-Aldrich) by reaction with 

N-maleimidobenzoyl-N-hydroxysuccinimide ester (Ausubel, supra) to increase immunogenicity. 
Rabbits are immunized with the oligopeptide-KLH complex in complete Freund's adjuvant. 
Resulting antisera are tested for antipeptide activity by, for example, binding the peptide to plastic, 
blocking with 1 % BS A, reacting with rabbit antisera, washing, and reacting with radioiodinated 
20 goat anti-rabbit IgG. 
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What is claimed is: 

1 . A substantially purified polynucleotide comprising a gene that is coexpressed with one or 
more known kidney disease-associated genes in a plurality of biological samples, wherein each 
known kidney disease-associated gene is selected from the group consisting of uromodulin, NKCC2 

5 (bumetanide-sensitive Na-K-CI cotransporter 2), NCCT (thiazine-sensitive Na-Cl cotransporter), 
aldolase B. ROMK1 (inwardly-rectifying voltage-gated K channel), ATP1G1 (Na-K ATPase 
gamma subunit), PDZK1 (PDZ domain-containing protein), NPT-1 (Na-dependent phosphate 
cotransporter), calbindin, kininogen, and CIC-Kb (chloride channel). 

2. The polynucleotide of claim 1, comprising a polynucleotide sequence selected from the 

10 group consisting of: 

(a) a polynucleotide sequence selected from the group consisting of SEQ ID NOs:l-9 

(b) a polynucleotide encoding a polypeptide sequence selected from the group consisting of 
SEQ ID NOs:10-12; 

(c) a polynucleotide sequence having at least 75% identity to the polynucleotide sequence of 

15 (a) 
or(b); 

(d) a polynucleotide sequence which ^complementary to the polynucleotide sequence of (a), 

(b) 

or (c); 

20 (e) a polynucleotide sequence comprising at least 18 sequential nucleotides of the 

polynucleotide sequence of (a), (b), (c), or (d); and 

(f) a polynucleotide which hybridizes under stringent conditions to the polynucleotide of (a), 

(b) , 

(c) , (d), or (e). 

25 3. A substantially purified polypeptide comprising the gene product of a gene that is 

coexpressed with one or more known kidney disease-associated genes in a plurality of biological 
samples, wherein each known kidney disease-associated gene is selected from the group consisting of 
uromodulin, NKCC2, NCCT, aldolase B, ROMK1, ATP1G1, PDZK1, NPT-1, calbindin, 
kininogen, and CIC-Kb. 

30 4. The polypeptide of claim 3, comprising a polypeptide sequence selected from the group 

consisting of. 

(a) the polypeptide having the amino acid sequence selected from the group consisting of 
SEQ ID NOs:10-12; 

(b) a polypeptide sequence having at least 85% identity to the polypeptide sequence of (a); 
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and 

(c) a polypeptide sequence comprising at least 6 sequential amino acids of the polypeptide 

sequence of (a) or (b). 

5 . An expression vector comprising the polynucleotide of claim 2. 
5 6. A host cell comprising the expression vector of claim 5. 

7. A pharmaceutical composition comprising the polynucleotide of claim 2 in conjunction 
with a suitable pharmaceutical carrier. 

8. A pharmaceutical composition comprising the polypeptide of claim 3 in conjunction with 

a suitable pharmaceutical carrier. 
10 9. An antibody or antibody fragment comprising an antigen binding site, wherein the * 

antigen 

binding site specifically binds to the polypeptide of claim 4. 

10. An immunoconjugate comprising the antigen binding site of the antibody or antibody 
fragment of claim 9 joined to a therapeutic agent. 

15 1 1 . A method for diagnosing a disease or condition associated with the altered expression of 

a gene that is coexpressed with one or more known kidney disease-associated genes, wherein each 
known kidney disease-associated gene is selected from the group consisting of uromodulin, NKCC2, 
NCCT, aldolase B, ROMK1, ATP1G1, PDZK1, NPT-1, calbindin, kininogen, and CIC-Kb, the 
method comprising the steps of: 

20 (a) providing a biological sample; 

(b) hybridizing a polynucleotide of claim 2 to the biological sample under conditions 
effective to form one or more hybridization complexes; 

(c) detecting the hybridization complexes; and 

(d) comparing the levels of the hybridization complexes with the level of hybridization 

25 complexes in a non-diseased sample, wherein the altered level of hybridization complexes compared 
with the level of hybridization complexes of a nondiseased sample correlates with the presence of the 
disease or condition. 

12. A method for treating or preventing a disease associated with the altered expression of a 
gene that is coexpressed with one or more known kidney disease-associated genes in a subject in 

30 need, wherein each known kidney disease-associated gene is selected from the group consisting of 
uromodulin, NKCC2, NCCT, aldolase B, ROMK1, ATP1G1, PDZK1, NPT-1, calbindin, 
kininogen, and CIC-Kb, the method comprising the step of administering to the subject in need the 
pharmaceutical composition of claim 7 in an amount effective for treating or preventing the disease. 

13. A method for treating or preventing a disease associated with the altered expression of a 
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gene that is coexpressed with one or more known kidney disease-associated genes in a subject in 
need wherein each known kidney disease-associated gene is selected from the group consisting of 
uromodulin, NKCC2, NCCT, aldolase B, ROMK1, ATP1G1, PDZK1, NPT-1, calbindin, 
kininogen, and CIC-Kb, the method comprising the step of administering to the subject in need the 
5 pharmaceutical composition of claim 8 in an amount effective for treating or preventing the disease. 

14. A method for treating or preventing a disease associated with the altered expression of a 
gene that is coexpressed with one or more known kidney disease-associated genes in a subject in 
need wherein each known kidney disease-associated gene is selected from the group consisting of 
uromodulin, NKCC2, NCCT, aldolase B, ROMK1, ATP1G1, PDZK1, NPT-1. calbindin, 

10 kininogen. and CIC-Kb, the method comprising the step of administering to the subject in need the 
antibody or the antibody fragment of claim 9 in an amount effective for treating or preventing the 
disease 

15. A method for treating or preventing a disease associated with the altered expression 
gene that is coexpressed with one or more known kidney disease-associated genes in a subject in 

15 need, wherein each known kidney disease-associated gene is selected from the group consisting of 
uromodulin, NKCC2, NCCT. aldolase B, ROMK1, ATP1G1, PDZK1. NPT-1, calbindin, 
kininogen, and CIC-Kb, the method comprising the step of administering to the subject in need the 
immunoconjugate of claim 10 in an amount effective for treating or preventing thedisease. 

1 6. A method for treating or preventing a disease associated with the altered expression of a 
20 gene that is coexpressed with one or more known kidney disease-associated genes in a subject in 

need, wherein each known kidney disease-associated gene is selected from the group consisting of 
uromodulin, NKCC2, NCCT, aldolase B, ROMK1 , ATP1G1, PDZK1. NPT-1, calbindin, 
kininogen, and CIC-Kb, the method comprising the step of administering to the subject in need the 
polynucleotide sequence of claim 2 in an amount effective for treating or preventing the disease. 
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SEQUENCE LISTING 



<110> INCYTE PHARMACEUTICALS , INC. 
WALKER, Michael, G. 
VOLKMUTH, Wayne 
KLINGLER, Tod, M. 
AZIMZAI, Yalda 
YUE, Henry 

<120> GENES ASSOCIATED WITH DISEASES OF THE KIDNEY 
<130> PB-0010 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 09/289,349 
<151> 1999-04-09 

<160> 12 

<170> PERL Program 

<210> 1 
<211> 613 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> incyte ID No.: 2675922CT1 



<400> 1 ^ mtm aettactacc tccagagaac cttccctggt gtggaggcag 60 

3= 3§| |fS i£i =3 SEE » 

e= ss lis 35 ssi ss = 

cccagcctgg tga 



<210> 2 

<211> 618 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> incyte ID No.: 3534377CT1 



SSS SSS55 ssss ssss fiiii S3 s 

taclaccact cttttccttg agactgcaaa gaaaatggtt ctatagtttg Jtggttcta 

£2233 Kaaaaaa ESSE tggctc £ 300 

SSaattcc agcactKgg gaggecaaga caggaggatt gcttgagccc aggaatttga 360 
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-^t-r-ht-t-at tttttatttt tcgacataga 420 

S5SKS SSES JSS U< £ 2SSS SS 
SSSS ™.« SSSJ. — ««- in 



ccatgttggc caggctgg 



<210> 3 
<211> 521 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No. 



3481942CB1 



<400> 3 

tctggagctg 

aggacttggc 

ggggtactca 
agtggggcca 
gggctgacat 
ggaaggggaa 
ccaacgtcct 
cccgagtgcc 
tgcctggaac 



ctggctggag 
aggcctgaac 
gatgtccagc 
ccaggaagtg 
ggagaaggct 
ggaattaatg 
ctcgggagcc 
tgagtggtgg 
catcagatga 



aggagggtgg 
atgcattggc 
cgcactctct 
ccggccaagt 
ggcaagcgac 
tggaatttca 
tgtggcctgc 
ctggtgatcc 
gatccactga 



acgaagctct 
tgcgaaaagt 
acattaatag 
ttaactttgc 
tcccaagccc 
gagaactgag 
agcgtgggga 
tgggctgcat 
catactgtat 



ctctagaaag 
tcagggactt 
taggcaactg 
tagtgatgtg 
agccctgtgg 
tgaaaacagc 
tcgtgtggca 
tcgagcaggg 



acatcctgag 60 
tgcaccctgt 120 
gtgtccctgc 180 
ttggatcact 240 
tgggtgaatg 3 00 
cagcaggcag 3 60 
gtggtgctgc 420 
ctcatcttta 480 
521 



<210> 4 

<211> 999 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature _^ m1 
<223> incyte ID No.: 2911650CT1 



c„ tg ^» g-SK SSSSS £ 

atgtctattt ggttctagtt ctataaaaat ^gaacattg gga g y aatctgg ata 180 
aaataaagtg gattcaggtt atcagtgaaa tgatggtate gat g tggctaaa at 240 
tatattgatg ttactgcatt tgtaaatgtc ctcccattaa acaga g tttgt 3 00 

ccctgg?ctg gaactgaact agtttaaggc ^gtaaga Jjjggj J J ttMttatt 3 60 
gaaagtttaa agaatgccct caatccatga acgagaggga gga g tagatctcca 420 

tagaaggttt ttttttcctt gtctgttgat ""ttatgg 99 « tta ttatact 480 
ttccttaaaa aatattacat gtcagtgtag ttagatatta ggcatgttta 540 

tgctattttt actgtatagt gggtataat caacaaaau J J^J ctaattt 600 
caccttttct gttgcactat taaattcttt "acag tacaagg acgtagcata 660 

tgtgatgcca attaaaaatc ttgttcattt a ««gt atttggaaga 720 

agaLcaaaa aaaattttgt attgtcattt ^atatcaa gaaa tatcac 780 

SSSS £SS£ » = j-ss- - 
S2SSS £2=5 PJE U s? ™ 5S 

agtcaacaaa tcatataata tgaaataaat agctccggg 



<210> 5 

<211> 616 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature <10AftA „_ 1 
<223> Incyte ID No.: 1900433CB1 



gccagcU f^cccte gccaa ggtga cctcjcjjg ejctgjtjjj gjjcjjtgj JO 
ggaacctgca gagtcacaca |"gctgacc aattgagctg J^W gaactgtgac 180 
gggctgcaga cccccgcccc jgtgcctcte «cc^g tgactgcct t ggaagccaat 240 
atggagagag tgaccctggc ccttctccca ctgg gg gaaaaacct gcagctgagc 300 
gacccatttg ccaataaaga <=gatcccttc Jactatgact f gagtggcaaa 360 

ggactgatct gcggagggct cctggccatt ^tgggatcg egg | cat cccactc 420 
Scaaataca agagcageca gaagcagcac jgtcctfltac 9 9 g atgg cctga 480 
atcactccag gctctgccac tacttgetga ^caggact OT agg aaggact 540 

SEKS SS SSSSS StSLSg aggcttcttt atgaattaaa 600 



ctcgccccac cacccc 



<210> 6 
<211> 862 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 

<223> incyie ID NO.: 2580580CT1 



<400> 6 tttatcaggg atctggcaac gtagaagaaa gaacaaagaa 60 

gttgcaattg cactactgat tttaccaggg au ya taatcacaat tgaaaatggc 120 
cca?ctgaag tggatgaege tgaagataag tgtgaaaaca tgatcac a gag 180 
atcccctctg atcccctgga catgaaggga Jtcctcaaga aattaaacat 240 

gatgagaggc tcacccctct ctgaagggct Jttgttctgc ttcctc ^ ^ ttt 300 
ttgtttctgt gtgactgctg agcatcctga ^taccaaga gc 9 aatC aattat 360 

caccattctt cttttgtaat aaattttgaa tgtgcttgaa 99 aaaat atttt 420 

acccaccaac accactgaaa tcataagcta "cacgactc J« tttaagcatt 480 

tctgacagta tagtgtataa atgtggtcat gggtatttg tagtta^ 
tttagaaata agatcaggca tatgtatata "ttca tcattgaaaa tgga tccttt 600 
aaattttcca gtggagaata catataatat 99tg^y agtgag aa gtaattat 660 

ttgacgatca cttatatcac tctgtatatg actaagtaaa caaaag gg ^ t 720 
tgtaaatgga tggataaaaa jBgagact catatacagg 9^ aattctattt 780 

SSSK SSSSS S£SS SStAct aacttaataa agtaataatc 840 



atctcttaaa aaaaaaaaaa aa 



<210> 7 

<211> 1090 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> incyte ID No.: 2673009CB1 



<400> 1 

tgccacctga 

ggtctaccga 

caegtcaggt 

gacagtggac 

gacagtcatg 

agatttcccc 

agacaaggac 

gtteggggag 



gcgccgctcc 
cctgatgtgg 
cccctcctgg 
ttcgtcagga 
gaggccgtgg 
aactccttcc 
tggttccacc 
ccccagtggg 



ctctcaggat 
acccagaggt 
accgtgtctt 
geaagcatge 
acctgctgga 
atgccttcca 
tegteggget 
ctgtcgtcgg 



gaaggtgacg 
ggecaaagae 
caccacctac 
ccagtttggg 
tgggctggtg 
gaeageggag 
cctgcacgac 
cgacaccttc 



gtgggcccag 
aaggecaget 
aagctcatgc 
ggcttctcct 
gatgagtegg 
ggcatccgga 
ctggggaagg 
cccgtcggat 



acccttccct 60 
teeggaacta 120 
acacgcacca 180 
acaagaaaat 240 
acccggacgt 300 
aggcccaccc 3 60 
tcctggccct 420 
gccgtccgca 480 
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ggcctccgtg gttttctgcg actccacctt 
atacagcaca gagctcggga tgtatcagcc 
ctggggccat gatgagtaca tgtaccaggt 
tgaggctttc tacatgatcc ggttccactc 
ccagcagctg tgcagccagc aggacctggc 
gttcgacctc tacaccaagt gcccggacct 
ccaggggctc attgacaagt actgccctgg 
aagctgctgc tggacctagg cctggccctc 
aacagccgcc atcagggttc acctcggtgg 
cccctcacgg caacttgtgc ctggcgtcaa 
aaaaaaaaaa 



ccaggacaac cctgacctcc aggatcctcg 540 
ccactgtggg ctcgacaggg tcctcatgtc 600 
gatgaagttt aacaagttct cactgccccc 660 
cttctacccc tggcacacgg gccgcgacta 720 
catgctgccc tgggtgcggg agttcaacaa 780 
gccggacgtg gacaagctgc ggccctacta 840 
catcctgagc tggtgaccct cctgccaccc 900 
cgcctgcctg gagaggcctg gccctgggca 960 
gggaccccac tcaccccctt agggtcgcca 1020 
taaagacctg gaaggatgtt gtgcttctga 1080 



<210> 8 
<211> 1108 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 1399234CT1 

<400> 8 



aaactaacct gcccaggacg ccctgggaac agggggcgga gctccggcta gaggggactc 60 
ctgtg?gggc gggcaggtc? ggagcctggg cccaggtgtg gcctgctggg ^cagctcc 120 
gggggagagc ?g?ggggaga acagcggtct agggaggcag gaggcaggcg ^aggctcct 180 
ccccagcagc ctgcagggca tcaggacaga gggggaagga ggagacagaa gggaaagaag 240 
SSSi ggSaaSSig aaacagcgcg gcttctgctg -jccagtac Wggg JJO 

ss? SS 2SS * « SSSSS «a? gg ?i „. 

ctcc?tggcg cfgtccccag ccccacaggc ctgctgcggc catccgcaaa cgggcatttc 480 
cttSaacttt tgagagtgcc tttgaccagc tgttcgcagc tgggaaactc caacttttga 540 

3=5K SSSS SSSS SSSS H3K 

Hba sfe s:s ssss ss=s ssss 

caaSggtS ccaagaaaga fgatacctct gcattttctg gtggaaaagg tgtaataccc 840 
ttaatgfgat caaagtgt?a ggggaaaaaa attccaaaag tagttacaag ^atatcaat 900 
^raaacrtaa atcrcacttca tcaaagctaa gaagtcacag gaattgttct caggttttta »bu 
laaaaKS? tcctgaattc aggaagtgtc ttctgaatag cagctagcca aataaagcgg 1020 
tgtgtgtgta ctgcagctgt aggtgaactt aaaaataata ataaaaagaa caaataaagc 1080 
agtgtgtacc agccaaaaaa aaaaaaaa 



<210> 9 
<211> 1290 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 1867351CB1 



<400> 9 

tctctagctc ctgggagagt 
gtgttcgtga aggagtgagt 
tgtagcaata gtgtcctgaa 
atagcccttt ataaaaatga 
tcccagggtg ccctgaaaac 
agcctcagtc agatcagcag 
gcgctaaatt gtccaggaaa 
tcttaatctt gctggtgggg 
ggaaagaaag tcaggtcaag 



gctccctgga gcacccaagc 
ggaggggcac tctgcgataa 
ttaactttct ctccactatt 
aaaaaaaaaa tcccttttaa 
cttgaaggaa ttttaacgaa 
atgcgccaag aagaaccttt 
cagctggtgc tgcggcgagg 
attttagtga gattctatgt 
tgatgctttt gagcttacac 



cagcgctggt gggatcatcg 60 
gatttctttt atttattcac 120 
agcacatatt ccttttatac 180 
gctgctaagt tcctttttcc 240 
cgatgttgga aagacaggcg 300 
gccggaacat ccacaggacg 360 
gcttctgctc ctgggggtct 420 
cagaattcag tgacgtggta 480 
acaattcaca ggcccaccag 540 
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tgacaattta ctgtgagtta atgtcattca 
tgcaaagaca catttttcta taaaaagaaa 
cccaagacac tgtctgaaag atgacatgag 
ggatcaatgt gctgactgca ttggccaatg 
caaacccata aactaactgc ttaagaattc 
acagtatatc tttccttggg ccttagatta 
tttttttttt tttttttaat agacggaagt 
ggtgcgatca tagctcactg cagcctcgaa 
agccaccaga gtagctgaga ctacaggggt 
tcttcatttg aggtctagat ctaggcactg 
cgagctgtgg aaatccaaac aaagactgat 
ctgcagcctc aacctcctgg gctcaagtga 
gaccacaggc gtgtgcccac acgcctagct 
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ggtgtgccca tggattttga gggctggaaa 600 
aagcaactaa ggttaaaagc tatattgtgg 660 
tagtaattca ccactatctg aaccaagcaa 720 
gctttgatac ttctgctatt tttttagaca 780 
atactgcttg aattatgtaa aatatatttt 840 
ctattcactg ggcaaatggt atttgttttt 9 00 
cttgctctgt catgcaggct ggagtgcggt 960 
ctcttgggct tcaagcaatc ctcctgtgtc 1020 
atgccaccat gcccagctgg catttgttaa 1080 
tggacactga aaaacagttg ggaaatcttt 1140 
aattcctggt aggggtgtgt gcgtgacgta 1200 
tcctcccacc tcagcctcct gagtagctga 1260 



<210> 10 

<211> 139 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 3481942CD1 

Me^isVp Leu Arg Lys Val Gin Gly Leu Cys Thr Leu Trp Gly 

Thr Gin Met Ser Ser Arg Thr Leu Tyr lie Asn Ser Arg Gin Leu 

20 25 
Val Ser Leu Gin Trp Gly His Gin Glu Val Pro Ala Lys Phe Asn 

Phe Ala Ser Asp Var Leu Asp His Trp Ala Asp Met Glu Lys Ala 

50 55 
Gly Lys Arg Leu Pro Ser Pro Ala Leu Trp Trp Val Asn Gly Lys 

Gly Lys Glu Leu Met Trp Asn Phe Arg Glu Leu Ser Glu Asn Ser 



30 

Gin Gin Ala Ala Asn Val Leu Ser Gly Ala cys Gly Leu Gin Arg 

95 1°° 

Gly Asp Arg Val Ala Val Val Leu Pro Arg Val Pro Glu Trp Trp 

Leu Val lie Leu £J Cys lie Arg Ala Gly Leu lie Phe Met Pro 

125 130 1" 

Gly Thr lie Arg 



<210> 11 

<211> 89 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No.: 1900433CD1 

iTlluArg Val Thr Leu Ala Leu Leu Leu Leu Ala Gly Leu Thr 



Ala Leu Glu Ala Asn Asp Pro Phe Ala Asn Lys Asp Asp Pro Phe 

20 25 
Tyr Tyr Asp Trp Lys Asn Leu Gin Leu Ser Gly Leu He Cys Gly 



5/6 



PCT/US00/08260 

WO 00/61622 

Gly Leu Leu Ala III Ala Gly He Ala Ala Val Leu Ser Gly Lys 

50 ^5 
cys Lys Tyr Lys Ser Ser Gin Lys Gin His Ser Pro Val Pro Glu 

Lys Ala He Pro Leu lie Thr Pro Gly Ser Ala Thr Thr Cys 
80 85 



<210> 12 
<211> 285 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No.: 2673009CD1 

<400> 12 



Met Lys Val Thr Val Gly Pro Asp Pro Ser Leu Val Tyr Arg Pro 
Asp Val Asp Pro Glu Val Ala Lys Asp Lys Ala Ser Phe Arg Asn 



20 



Tyr Thr Ser Gly Pro Leu Leu Asp Arg Val Phe Thr Thr Tyr Lys 
* 35 40 " 

Leu Met His Thr His Gin Thr Val Asp Phe Val Arg Ser Lys His 

50 " 
Ala Gin Phe Gly Gly Phe Ser Tyr Lys Lys Met Thr Val Met Glu 



65 



70 



Ala Val Asp Leu Leu Asp Gly Leu Val Asp Glu ser Asp Pro Asp 



80 

Val Asp Phe Pro Asn Ser Phe His Ala Phe Gin Thr Ala Glu Gly 



lie Arg Lys Ala His Pro Asp Lys Asp T^p Phe His Leu Val Gly 



100 



110 115 120 

Leu Leu His Asp Leu Gly Lys Val Leu Ala Leu Phe Gly Glu Pro 

125 l-* 0 
Gin Trp Ala Val Val Gly Asp Thr Phe Pro Val Gly Cys Arg Pro 

140 145 
Gin Ala Ser Val Val Phe Cys Asp Ser Thr Phe Gin Asp Asn Pro 
155 

Asp Leu Gin Asp Pro Arg Tyr Ser Thr Glu Leu Gly Met Tyr Gin 



170 

Pro His Cys Gly Leu Asp Arg Val Leu Met Ser Trp Gly His Asp 



175 



185 — 
Glu Tyr Met Tyr Gin Val Met Lys Phe Asn Lys Phe Ser Leu Pro 



190 



Pro Glu Ala Phe Tyr Met He Arg Phe His Ser Phe Tyr Pro Trp 

215 220 
His Thr Gly Arg Asp Tyr Gin Gin Leu cys Ser Gin Gin Asp Leu 

230 23i 
Ala Met Leu Pro Trp Val Arg Glu Phe Asn Lys Phe Asp Leu Tyr 



205 



Thr Lys cys Pro III Leu Pro Asp Val Asp Lys Leu Arg Pro Tyr 



Tyr Gin Gly Leu 5! Asp Lys Tyr Cys Pro Gly lie Leu Ser Trp 



265 



275 



280 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 

This international Search Report has not been established In respect of certain claims under Article 1 7(2)(a) for the following reasons: 

1. [x] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Although claims 12-16 are directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the alleged 
effects of the compound/composition. 

2. Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I | As ail required additional search fees were timely paid by the applicant, this International Search Report covers all 
1 — 1 searchable claims. 

As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
1 — 1 covers only those claims for which fees were paid, specifically claims Nos.: 



4. |_yj No required additional search fees were timely paid by the applicant Consequently, this International Search Report i: 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

2, 4-7, 9-12, 14-16 all partially and 1,3,8 and 13 completely 



Remark on Protest j The additional search fees were accompanied by the applicant's protest. 

| | No Potest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 — — — - 



1. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.l that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide, an 
expression vector, a host cell, a pharmaceutical 
composition, an antibody or antibody fragment, and 
imnunoconjugate; a method for diagnosing the disease 
associated with such gene and a method of treatment for such 
disease. 



2. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.2 that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide, an 
expression vector, a host cell, a pharmaceutical 
composition, an antibody or antibody fragment, and 
imnunoconjugate; a method for diagnosing the disease 
associated with such gene and a method of treatment for such 
di sease. 



3. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.3 that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide 
(SEQ.ID.N.10), an expression vector, a host cell, a 
pharmaceutical composition, an antibody or antibody 
fragment, and immunoconjugate; a method for diagnosing the 
disease associated with such gene and a method of treatment 
for such disease. 



4. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 

A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.4 that is 
co-expressed wi th one or more known ki dney 
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disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide, an 
expression vector, a host cell, a pharmaceutical 
composition, an antibody or antibody fragment, and 
iriinunoconjugate; a method for diagnosing the disease 
associated with such gene and a method of treatment for such 
disease. 



5. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.5 that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide 
(SEQ.ID.N.ll) , an expression vector, a host cell, a 
pharmaceutical composition, an antibody or antibody 
fragment, and immunocon jugate; a method for diagnosing the 
disease associated with such gene and a method of treatment 
for such disease. 



6. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.6 that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide, an 
expression vector, a host cell, a pharmaceutical 
composition, an antibody or antibody fragment, and 
immunoconjugate; a method for diagnosing the disease 
associated with such gene and a method of treatment for such 
disease. 



7. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.7 that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide 
(SEQ.ID.N.12) , an expression vector, a host cell, a 
pharmaceutical composition, an antibody or antibody 
fragment, and immunoconjugate; a method for diagnosing the 
disease associated with such gene and a method of treatment 
for such disease. 
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8. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.8 that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide, an 
expression vector, a host cell, a pharmaceutical 
composition, an antibody or antibody fragment, and 
immunocon jugate; a method for diagnosing the disease 
associated with such gene and a method of treatment for such 
disease. 



9. Claims: 2, 4-7,9-12,14-16 all partially and 1,3, 
8 and 13 completely 



A substantially purified polynucleotide comprising a gene 
comprising a polynucleotide from SEQ.ID.N.9 that is 
co-expressed with one or more known kidney 
disease-associated genes in a plurality of biological 
samples, the corresponding purified polypeptide, an 
expression vector, a host cell, a pharmaceutical 
composition, an antibody or antibody fragment, and 
immunocon jugate; a method for diagnosing the disease 
associated with such gene and a method of treatment for such 
disease. 
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